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Abstract 

The computational complexity of the Maximum Likelihood decoding algorithm in [1], [2] for orthogonal 
space-time block codes is smaller than specified. 

I. Introduction 

In [1],[2], the decoding of an Orthogonal Space-Time Block Code (OSTBC) in a Multi-Input Multi- 
Output (MIMO) system with N transmit and M receive antennas, and an interval of T symbols during 
which the channel is constant, is considered. The received signal is given by 



Y = Q N H + V 



(1) 



where Y = [yi]rxM is the received signal matrix of size T x M and whose entry y{ is the signal 
received at antenna j at time t, t = 1, 2, . . . , T, j = 1, 2 . . . , M; V = [v 



t]rxM 



is the noise matrix, and 



Qn = [glWxN is the transmitted signal matrix whose entry g\ is the signal transmitted at antenna i at time 
t, t = 1, 2, . . . , N. The matrix H = [/ijj]jVxM is me channel coefficient matrix of size N x M whose entry 
hij is the channel coefficient from transmit antenna i to receive antenna j. The entries of the matrices H 
and V are independent, zero-mean, and circularly symmetric complex Gaussian random variables. 

The real-valued representation of (QQ) is obtained by first arranging the matrices Y, H, and V, each in 
one column vector by stacking their columns one after the other as 



(2) 



where Qn = hi <8> Qn, with lu denoting the identity matrix of size M and ® denoting the Kronecker 
matrix multiplication, and then decomposing the MT-dimensional complex problem defined by © to a 
2MT-dimensional real-valued problem by applying the real-valued lattice representation defined in [3] to 
obtain 

Hx + v (3) 
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or equivalently 
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(4) 



The real-valued fading coefficients of H are defined using the complex fading coefficients h iy j from 



transmit antenna i to receive antenna j as h 



2l-l+2(J-l)N 



Re(hij) and h 



2Z+2(i-l)JV 



Im(hij) for 
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I = 1,2, ...,N and j = 1,2, ... ,M. Since Gn is an orthogonal matrix and due to the real- valued 
representation of the system using ©, it can be observed that 

• All columns of H = [hih 2 . . . h 2 K\ where hi is the iih. column of H, are orthogonal to each other, 
or equivalently 

(5) 
(6) 



hfh j = i,j = l,2,...,K,i^j 
• The inner product of every column in H with itself is equal to a constant, i.e., 



hjhi = hjhj 



i,J = 1,2, 
II. Decoding 



Let 

Note a = hjhi, i — 1,2, 



a = h\hx. 

2K. Due to the orthogonality property in ©-©, we have 



H'H = al 



2K- 



Let's represent © as 



(7) 

(8) 
(9) 



(10) 
(11) 



y = Hx + v. 

By multiplying this expression by H T on the left, we have 

V = H T y 
= ax + v 

where v is zero-mean, and due to ([8]) has independent and identically distributed Gaussian members. The 
Maximum Likelihood solution is found by minimizing 

2 



(12) 









X\ 


















— a 








. V2K . 




. %2K . 





over all combinations of x G Or . This can be further simplified as 

Xi = argmin — aXi\ 2 

for i = 1, 2, . . . , 2K. Then, the decoded message is 

x = {x x ,x 2 , . . .,x 2 k) T - 



(13) 



(14) 



III. Computational Complexity 

The decoding operation consists of the multiplication H T y, calculation of a = hjhi, the multiplica- 
tions axi, and performing (fT3l) . With a slight change, we will consider the calculation of a^ 1 and the 
multiplications 

z i = (j- 1 y i i = l,2,...,2K. (15) 



Then 



x i = arg mm | Zi — x i 

Xi£fl 



(16) 



for i = 1,2, . . . ,2K, which is a standard quantization operation in conventional Quadrature Amplitude 
Modulation. We will compute the decoding complexity up to this quantization operation. Note H is a 
2MT x 2K matrix, hi is a 2MT-dimensional vector, and we will assume the complexity of real division 
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as equivalent to 4 real multiplications as in [1],[2]. The multiplication H T y takes 2MT ■ 2K, calculation 
of a takes 2MT, its inverse takes 4, and <r~ l y takes 2K real multiplications. Similarly, the multiplication 
H T y takes 2K ■ (2MT - 1), and calculation of a takes 2MT - 1 real additions. Letting R M and R A 
be the number of real multiplications and the number of real additions, the complexity of decoding the 
transmitted complex signal (si, s 2 , • • • , sk) with the technique described in ©, (fiOT) . and (fl"5l) is 

C PR = (AKMT + 2MT + + 4)i2 M , (AKMT + 2MT -2K- 1)R A (17) 

which is smaller than the complexity specified in [1],[2] and does not depend on the constellation size L. 
However, as will be seen in the examples, the matrix H can include values identical to 0, or multiplications 
by a scalar, which result in deviations from (fTTT) . Also, in (l56l) . we will provide a slightly smaller figure 
for this complexity. In what follows, we will calculate the exact complexity values for four examples. 
Due to the orthogonality property © of H, the QR decomposition of H is 

Q = -7=6 R = ^I 2K (18) 

and therefore does not need to be computed explicitly. The procedure described above is equivalent and 
has lower computational complexity. 



IV. Comparison with a Conventional Technique 
We will now compare the technique in ©, (flOl) . and (fl"5l) with one from the literature. In [4], it has 



been shown that 



where 



K 



\Y - G N H\\ 2 = \\H\\ 2 ^2\s k - s k \ 2 + constant, 



k=l 



Sk 



\H\ 



[Re{Tr(H H A»Y)} - i ■ lm{Tr(H B^Y)}} 



(19) 



(20) 



and where and Bk are the matrices in the linear representation of Qn in terms of Sk = Re[sfc] and 
s k = Im[sfc] for k — 1, 2, . . . , K as [4] 



K 



K 



Qn = 2j s k A k + iskB k = s k A k + s* k B k 



(21) 



k=l 



k=l 



i = a/— T, Ak = Ak + Bk, and B k = A k — Bk- Once {§k}k=i are calculated, the decoding problem can 
be solved by 



mm \s k - s k \ 



(22) 



once for each k = 1, 2, . . . , K. Similarly to (fT6l) , this is a standard quantization problem in Quadrature 
Amplitude Modulation and we will calculate the computational complexity of this approach up to this 
point. 

We will carry out the computational complexity analysis of the technique in CD), (flOl) . and (fl"5l) against 
the complexity of the technique in (|20l ) for four examples, including those in [1], [2]. 
Example 1: Consider the Alamouti OSTBC with N = K = T = 2mdM = l where 



si s 2 

5 2 ftj 



The received signal is given by 



Vi 

2/2 



So 5i 



h 2 ,i 



+ 



>'2 



(23) 



(24) 
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Representing (|24l) in the real domain, we have 



" Re(yi) " 




x x 




" Re(>i) " 


Im(yi) 


= H 




+ 


Im(f i) 




X3 


Re(v 2 ) 


_ Im(2/ 2 ) _ 




x 4 




Im(v 2 ) 



(25) 



where x\ = Re(si), x 2 = Im(si 



H 



Re(s 2 ): 

hi 
h 2 
h 3 
h/i 



Xi = Im(s2) and 



-h 2 h 3 
hi 

h 4 —hi 

-h 3 -h 2 



—hi 

h 3 
-h 2 

hi 



(26) 



Note that the matrix H is orthogonal and all of its columns have the same squared norm. One needs 16 real 
multiplications to calculate y = H T y, 4 real multiplications to calculate a = h\h\, 4 real multiplications 

r 1 'i n r\ A roil tnn1fir\1ir>ofir«tio i r\ r>o1r>n1nfa /t 



to calculate a~ , and 4 real multiplications to calculate a~ y. There are 3 • 4 = 12 real additions to 
calculate H T y and 3 real additions to calculate a. As a result, with this approach, decoding takes a total 
of 28 real multiplications and 15 real additions. 
For the method in (|20l ) above, the products H H A^, 

H B A? = [h^ 1 h*^ 
H H A*=[h* 2>1 -h* hl 

which will be multiplied by Y = (yi,y 2 ) T where hx y 



, H H B±, 

H H B* -- 
H H B 2 

,1-, h 2 \ 



H H A 2 , 

- [ K,i 
= [ h 2 A 



H H B 2 are 

-K,i ] 
h*i,i] 



(27) 



yx, y 2 are all complex. It can be observed 
from (|20l ) and (1271 ) that one needs all products h^yj, i,j = 1,2. Therefore, one needs 4 complex or 
16 real multiplications. The calculation of \\H\\ 2 takes 4, its reciprocal l/||if || 2 4, and the multiplication 
of 1/||#|| 2 with Re{Tr[H H A^Y}} and Im{Tr[H H B* Y)} for k = 1,2 another 4 real multiplications. It 
can be calculated that each of Re{Tr[H H A% Y]} and lm{Ti[H H Bj^Y]} has 3 distinct real additions for 
k = 1,2, which means there are a total of 12 real additions for this operation. Calculation of \\H\\ 2 takes 3 
real additions. As a result, this approach employs 28 real multiplications and 15 real additions to decode. 

Note, in this case, the complexity figures in (fTTT) are 28 real multiplications and 15 real additions, which 
hold exactly. 

Example 2: Consider the OSTBC with M = 2, JV = 3, T = 8, and K 



si s 2 

52 Si 

53 S4 



S3 
Si 
Si 



S4 
-S 3 

S-2 



4 given by [5] 

-l T 



(28) 



The received signal can be written as 





v\- 




h,i 


hi, 2 










= Gs 


h 2 ,i 


h 2 ,2 


+ 




.vl 


vl. 




_ h 3A 


h 3 ,2 _ 




. 4 v 2 s . 



(29) 



In [2], it has been shown that the 32 x 8 real- valued channel matrix H is 

hi —h 2 /i 3 —h^ h§ —h% 
h 2 hi hi /13 h^ h$ 



H 



h 7 
hg 






-h 8 h 9 
h 7 hi 



-ho 
h 9 



hi 2 



-hi 2 
hn 








hi 
hi 2 



hi2 —hg —hio 
—hn —hio hg 






-h 7 
-h H 









-h 8 
h 7 



(30) 
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where hi, i — 1, 2, . . . , 11 and h~, j = 2, 4, 



12 are the real and imaginary parts, respectively, of 



h-2,1, h 3i i, hi :2 , h 2> 2, ^3,2- The matrix H T is 8 x 32 where each row has 8 zeros, while each of the remaining 
24 symbols has one of hi, hi, ■ ■ ■ , hi 2 , repeated twice. Let's first ignore the repetition of hi in a row. Then, 
the calculation of H T y takes 8 • 24 = 192 real multiplications. The calculation of a = h\hi = 2 YlkLi hi 
takes 12 + 1 = 13 real multiplications, In addition, one needs 4 real multiplications to calculate a -1 , and 
8 real multiplications to calculate <r~ l y. To calculate H T y, one needs 8 • 23 = 184 real additions, and to 
calculate a, one needs 11 real additions. As a result, with this approach, one needs a total of 217 real 
multiplications and 195 real additions to decode. 

For the method in (|20l) above, the products H H A^ and H H are 



H H A? 
H H B? = 



h* h* h* 
"l,l "2,1 "3,1 





h* 
"1,1 


h* 
"2,1 


Ki 


h* h* h* 

"l,2 "2,2 "3,2 





h* 
"1,2 


h* 
"2,2 


h( 2 


h* h* h* fl 
"l,l "2,1 "3,1 u 




■h* 
"1,1 


—h* 
"2,1 


-^3,1 


h* h* h* D 
"1,2 "2,2 "3,2 u 




■h* 
"1,2 


—h* 

"2,2 


— h* 

"3,2 








(31) 



Other H H A^ and H H have similar structures, with zero columns located elsewhere, same location 
in H H Af and H H B^, k = 2,3,4. Nonzero columns of H H A% and H H B% are the shuffled versions of 
the columns of H H A^ and H H B± , with the same shuffling for H H A^ and H H Bf? ', possibly with sign 
changes. As a result, the first four columns of H H A^ and H H Bjf are the same, the first and second four 
columns of H H A^ are the same, while the first and second four columns of H H B^ are negatives of each 
other, k — 1, 2, 3, 4. For this Q N , one has 



GnGn 



K 

E 

,fc=i 



Sk\ 



(32) 



which makes it necessary to replace with 2||fZ"|f in (1201) above. The vector Y is given as 



Y 



L y\ vl J 



(33) 



The complex multiplications in calculating Re{Tr[iJ H ylfF]} can be used to calculate lm{Ti[H H B^Y]} 
due to sign changes and the calculation of real and imaginary parts. Ignoring the repetition of h*,, there 
are 12 different complex numbers in H H A^ and due to the trace operation, they will be multiplied with 12 
complex numbers from Y. As a result, to calculate Tr[H H A^Y] (equivalently Tr[H H BjfY}) one needs 12 
complex or 48 real multiplications for one k. To calculate the numerators of sjt, for all k = 1,2, 3, 4, one 
needs 192 real multiplications. To calculate 2||if || 2 in the denominator, one needs 13 real multiplications. 
To calculate its inverse, one needs 4 real multiplications. Finally, to complete the calculation of s fc for 
k = 1,2,3,4 by multiplying the numerators of their real and imaginary parts by l/(2||iJ|| 2 ), one needs 
8 real multiplications. To calculate each Re{Tr[H H A% Y]} or lm{Tr[H H Bj* Y]} for k = 1,2,3,4, one 
needs 12 + 11 = 23 real additions. To calculate ||-£f|| 2 , one needs 11 additions. As a result, with this 
approach, one needs 217 real multiplications and 195 real additions to decode, same number as in the 
approach specified by ©, (flOl) . and (TT5T) . 

For this example, (flTT) specifies 300 real multiplications and 279 real additions. The reduction is due 
to the elements with zero values in H. 

It is important to make the observation that the repeated values of hi in the columns of H, or equivalently 
h* mn in the rows of H H A^ or H H Bj?, have a substantial impact on complexity. We will carry out the 
rest of this discussion only for the approach in ©, (flOl) . and (fT5l) . the one in (|2Q|) is essentially the same. 
Due to the repetition of hi, by grouping the two values of yj that it multiplies, it takes 8 • 12 = 96 real 
multiplications to compute H T y, not 8 • 24 = 192. The summations for each row of H T y will now be 
done in two steps, first 12 pairs of additions per each hi, and then after multiplication by hi, addition of 
12 real numbers. This takes 12 + 11 = 23 real additions, with no change from the way the calculation was 
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made without grouping. With this change, the complexity of decoding becomes 121 real multiplications 
and 195 real additions, a huge reduction from 300 real multiplications and 279 real additions. 

Example 3: We will now consider the code Q A from [5]. The parameters for this code are N — K — 4, 
M — 1, and T — 8. It is given as 



$1 —$2 —$3 ~ s 4 

5 2 Si S 4 -S3 

5 3 -S 4 Si S 2 

5 4 S 3 —S 2 Si 



-l T 



-«3 

~4 



SI 

-4 
4 
4 



(34) 



Similarly to Gs of Example 2, this code has the property that G 4 G± = 2($^fc=i l s fc| 2 K- As a result, \\H\\ 2 
in the denominator in (1201) should be replaced with 2||i7|| 2 . The matrix is 16 x 8 and can be calculated 

as 



H 



' h 


-h 2 


hs 


—h 4 


h 5 


-h e 


hr 


h 8 


h 2 


h x 


hi 


h 3 


h 6 


h 5 


hs 


h 7 


h 3 


— h 4 




h 2 


h 7 


-h 8 


-h 5 


h e 


h 4 


h 3 


-h 2 


-h 


hg 


h 7 


-he 


-h 5 


h 5 


h 6 


-h 7 


h 8 


-h 


-h 2 


hs 


h 4 


. h 


-hs 


-h 8 


h 7 


-h 2 




hi 


-hs . 



(35) 



This matrix consists entirely of nonzero entries. Each entry in a column equals ±h for some i G 
{1,2, ...,8}, every hi appearing twice in a column. Ignoring this repetition for now, calculation of 
H T y takes 8 • 16 = 128 real multiplications. Calculation of a takes 9 real multiplications, its inverse 
4 real multiplications, and the calculation of o~ l y takes 8 real multiplications. Calculation of H T y takes 
8 • 15 = 120 real additions, and calculation of a takes 7 real additions. As a result, with this approach, to 
decode, one needs 149 real multiplications and 127 real additions. 

For this code, for the method in (|20l ), the matrices H H A? and H H Bff k — 1, 2, 3, 4 are as follows. 



H H A? = 


[ "1,1 


h* 
"2,1 


h* h* h* h* 

U S,1 "-4,1 "'1,1 u 2,l 


h* 
n 3,l 


K,i } 




H H A» = 


r h* 


— h* 

'n,i 


—h* h* h* 

"i,l "3,1 "2,1 


— h* 
"l,l 


-hli 


h* 


H H A% = 


\ h* 


h* 


— h* —h* h* 
"1,1 "2,1 "3,1 


h* 


— h* 
n l,l 


— h* 

a 2,l 


H H Af = 


\ h* 


— h* 


h* —h* h* 
"2,1 "l,l 'Mjl 


—h* 

"3,1 


h* 
"2,1 


— h* 


H H B? = 


f h* 

i "1,1 


h* 
"2,1 


^3,1 ^4,1 —h\i — 


h* 


— h* 
>%1 


— h* 
"'4,1 


H H B% = 


f h* 
i "2,1 


— h* 

'n,i 


_A* h* —h* 
"i,l "3,1 "2,1 


h* 
"1,1 


h* 
"4,1 


h* 

IL 3,1 



H H B 



H 



H U H 



H B 



(36) 



*3,1 



h 



4,1 



h* 
"4,1 

-h 



3,1 



-h* 
"3,1 

-h* 
'''4,1 



-h 



4,1 



h* 
"3,1 



h* 
-h 



h 



2,1 



2.1 
h* 

"l,l 



-h* —h* 

lb l,l "2,1 

h* —h* 
"2,1 "l,l 

From this set we conclude that the complex multiplications between H H A^Y and H H B k HY can be shared 
for a given k = 1,2, 3, 4. The number of real multiplications to calculate H H A^Y for all k = 1, 2, 3, 4 is 
4 • 8 • 4 = 128. The number of real multiplications to calculate 2||i/|| 2 is 6 + 1 = 7, and to calculate its 
inverse takes 4 real multiplications. Finally, the number of real multiplications to complete the calculation 
of 5^ for all k = 1,2,3,4 is 8. In order to calculate H H A^Y or H H Bj^Y, one needs 8 real additions 
to perform each complex multiplication and 7 real additions to calculate the sum. As a result, calculation 
of Re{Ti[H H A^Y]} and Im{Tr[H H Bj?Y]} for all k = 1,2,3,4 takes 8 x 15 real additions. Calculation 
of \\H\\ 2 takes 6 + 1 = 7 real additions. Therefore, with this approach the number of real multiplications 
and additions to decode are 149 and 127, respectively, same as the numbers needed for the approach in 
©, COB, and COD). 
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For this example, equation (fT71) specifies 156 real multiplications and 135 real additions. The reduction is 
due to the fact that one row of H T has each hi appearing twice. This reduces the number of multiplications 
and summations to calculate a by about a factor of 2. 

However, because each hi appears twice in every row of H T , the number of multiplications can actually 
be reduced substantially, as we discussed in Example 2. As discussed in Example 2, we can reduce the 
number of multiplications to calculate H T y by grouping the two multipliers of each hi by summing them 
prior to multiplication by hi, i = 1, 2, . . . , 8. As seen in Example 2, this does not alter the number of 
real additions. With this simple change, the number of real multiplications to decode becomes 85 and the 
number of real additions to decode remains at 127. 

Example 4: It is instructive to consider the code 7i 3 given in [5] with A r = 3, K = 3, T = 4 which 
we will consider for M = 1 where 

S3/V2 
S3/V2 
-s 1 -s* 1 + s 2 -s*)/2 
(s 2 + s* 2 + si - s*)/2 



Sl 

—s* 



s 2 
g* 

s* 3 /V2 



(37) 



For this code, HSH.^ 



(El 



_ sl/V2 s* 3 /y/2 
Isfcl 2 )/ is satisfied. In this case, the matrix H can be calculated as 



H 



hi 


-h 2 


h 


—h 4 


h/V2 


-h 6 /V2 


h 2 


hi 


hi 


h 3 


h e /V2 


h/V2 


h 3 


hi 


-hi 


-h 2 


h/V2 


-he/y/2 




-hs 


-h 2 


hi 


K/V2 


h/V2 


-h 5 








-h% 


(hi + h 3 )/y/2 


(h 2 + h 4 )/V2 


-h 6 








h 5 


(h 2 + h 4 )/V2 


-(h 1+ h 3 )/V2 





he 


h 





(hi - h 3 )/V2 


(h 2 - h 4 )/V2 





-h 5 


h 6 





(h 2 - h 4 )/V2 


(-h + h)/^ 



(38) 



It can be verified that every column hi of H has the property that hf hi = a = \\H\\ 2 = ^\ =1 h\ for 
i = 1,2,. ..,6. In this case, the number of real multiplications to calculate H T y requires more caution 
than the previous examples. For the first four rows of H T , this number is 6 real multiplications per row. 
For the last two rows, due to combining, e.g., hi and /13 in (hi + h 3 )/^/2 in the fifth element of h 5 , and 
the commonality of h 5 and h 6 for the first and third, and second and fourth, respectively, elements of h 5 , 
and one single multiplier for the whole column, the number of real multiplications needed is 7. As 

a result, calculation of H T y takes 38 real multiplications. Calculation of a takes 6 real multiplications. 
One needs 4 real multiplications to calculate er _1 , and 6 real multiplications to calculate a~ l y. First four 
rows of H T y require 5 real additions each. Last two rows of H T y require 4 + 7 = 11 real additions each. 
This is a total of 42 real additions to calculate H T y. Calculation of a requires 5 real additions. Overall, 
with this approach one needs 54 real multiplications and 47 real additions to decode. 

For this code, for the method in (|20l) above, the matrices H H A^ and H H Bjf, k = 1, 2, 3 are as follows. 



H H A? = 


[hti h* 2 ^ -hi, 0] 


H H A» = 


[ hi, -hi, hi, ] 


H H A* = 


4=[ ^3,1 ^3,1 ^,1 + ^2,1 h*^ 


H H B? = 


\Ki -h* A h* 3> ,\ 


H H B2 = 


[hli h* h , hi, 0] 


H H B 3 H = 


[ h* h* —h* — h* — 
^[ u 3,l ""i,! 'n,i "-2,1 



h 



2.1 



(39) 



n, , -\- a 2 , 
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Before discussing the complexity of the approach in (1201 ), we would like to make an observation. 
A careful examination shows that the complex multiplications between H H A^Y for k = 1, 2, 3 and 
H H Bj I Y for j = 1,2,3 can be shared in the method outlined in (|20l ). In this case, since h% in the 
first and second element of H H A 3 can be shared, there are 9 complex multiplications needed for the 
calculation of H H A^Y for k = 1, 2, 3. The real values of those will be used in calculating the real parts 
of s k , k = 1, 2, 3, and the imaginary parts in calculating the imaginary parts of s*., k = 1, 2, 3, albeit in 
possibly different signs or locations. This requires a careful implementation where the needed complex 
multiplications are calculated, stored, and their real and imaginary parts carefully distributed in the most 
judicious manner. The 9 complex multiplications correspond to 36 real multiplications, and there are 2 
more real multiplications, by l/y/2 for the real and imaginary parts of s 3 . As in the previous method, 6 
real multiplications are needed to calculate ||f/|| 2 , 4 real multiplications to calculate l/||i/|| 2 , and 6 real 
multiplications to complete the calculation of s k , k = 1,2,3. The calculation of Re{Ti[H H Aj/Y}} and 
Im{Tr[H H Bj?Y]} for all k = 1, 2, 3 takes 4 ■ 5 + 2 • (6 + 5) = 42 real additions, and the calculation of 
|| if || 2 takes 5 more real additions. This approach results in a total of 54 real multiplications and 47 real 
additions to decode, as in the technique in CD), (flOl) . and (fl"5T) . 

For this example, (flTl) specifies 66 real multiplications and 49 real additions. The reduction is due to 
the presence of the zero entries in H. On the other hand, the presence of the factor 1 / y/2 in the last two 
rows of H T adds two real multiplications to the total number of real multiplications. 

Before concluding this example, we would like to display the matrices A 3 and B 3 for this code. 



A, 



l/y/2 

l/y/2 

l/y/2 l/y/2 

_ l/y/2 -l/y/2 



l/y/2 

l/y/2 

-l/y/2 -l/y/2 

-l/y/2 l/y/2 



(40) 



In all other A k and B k matrices in the four examples studied, the entries were ±1. Furthermore, in all 
other A k and B k matrices in the four examples, there was at most one nonzero value in a row. In both 
A 3 and B 3 above, the entries are irrational numbers and two rows have two nonzero entries. 

From the examples above, by studying the operations of the two techniques in detail, it can actually 
be seen that, not only is the computational complexity of the technique in ©, (fTTI) . and (fl~5T) is the same 
as the technique in (|20l) . but also they actually perform equivalent operations. 



V. Orthogonality of H and Computational Complexity Revisited 

We have seen in the examples that when Q^Qn = c (J2k=i \ s k\ 2 )I where c = 1,2, then a = c\\H\\ 2 . 
We will now show this holds in general. Based on that result, we will then reduce the computational 
complexity estimate in (fTTI) . 

Let 



vec(F) 



iv¥ J 



(41) 



Form two vectors, s and s, consisting of real and imaginary parts of s k , and form a vector s' that is the 
concatenation of s and s: 



S = (Si, S 2 , ■ ■ ■ , S K ) T , S = (si, S 2 , ■ ■ ■ , S K ) T , 

By rearranging the right hand side of ©, we can write 

z = Fs' + e = F a s + F b s + e 



[s,s) 



(42) 



(43) 



where F = [F a is an MT x 2K, and F a and Fb are MT x K complex matrices whose entries 
consist of (linear combinations of) channel coefficients h^j, and e is the corresponding complex Gaussian 
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noise vector. In [4], it was shown that when Q§Q N = (J2k=i \ s k\ 2 )L then Re[F H F] = \\H\\ 2 I. It is 
straightforward to extend this result so that when Q§Qn = c (J2k=i \ s k\ 2 )L then 

Re[F H F] = c\\Hfl 

where c is a positive integer. Let 

z = Re{z], z — lm[z), e = Re[e], e = Im[e], 

and 

F a = Re[F a ], F a = lm[F a ], F b = Re[F b ], F b = lm[F b ] 

Now define 



z 
z 



F' 



F a F b 
F a F b 



e 
e 



so that we can write 



y > = F's 1 + e' 



(44) 

(45) 
(46) 

(47) 
(48) 



which is actually the same expression as © except the vectors and matrices have their rows and columns 
permuted. 

It can be shown that (144b implies 

F' T F' = c\\H\\ 2 L 



(49) 



Let P y and P s be 2K x 2K and 2MT x 2MT, respectively, permutation matrices such that 



P,s'. 



- Re(yl) - 




' Re(si) " 


Imfj/i 1 ) 


= P y y' 


Im(si) 






Re(s^) 


. Mv¥) - 




. lm(s K ) . 



(50) 



It follows that P T P, = P. PT = I and P T P„ = P. PT = I. 



We now have 



y y y y 



Re(yl) 
lm(yl) 

L Mv¥) 



P y {F's' + e') 



(51) 



PyF'P 



Re(si) 
Im(si) 

Re{s K ) 
_ lm(s K ) _ 



P y e' 



(52) 



H 



' Re(si) " 
Im(si) 


+ 


' Re(vl) ' 
lm(v\) 


Re(s K ) 
_ kn(s K ) . 




Re«) 
. Im«) . 



(53) 



Therefore, 



H = PyF'Pj 



(54) 
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which implies 

H T H = P s F ,T pTp y F'Pj = c\\H\\ 2 L (55) 



In other words, a = c\\H\\ . This has an impact on the computational complexity formula (|T71) which we 
discuss next. 

First, let c = 1. Since a = \\H\\ 2 , its calculation takes 2MN real multiplications and 2MN — 1 real 
additions. As a result, the computational complexity formula (fTTT) can be updated as 

C PR = (AKMT + 2MN + 2K + 4)R M , (AKMT + 2MN -2K-1)R A . (56) 

When c > 1, the number of real multiplications to calculate a increases by 1, however, the complexity 
of the calculation of H T y will reduce by a factor of c, as seen in the examples. 

As seen in the examples, the presence of values of within H will reduce the computational complexity. 
Its effect will be a reduction in the number of real multiplications to calculate H T y by a factor equal to 
the ratio of the rows of A k and B k that consist only of values to the total number of all rows in A k and 
B k for k = 1, 2 . . . , K, with a similar (not same) reduction in the number of real additions to calculate 
H T y. It will also reduce the number of real multiplications and additions to calculate a but that effect can 
be more complicated, as seen in Example 4. Also, as seen in Example 4, the contents of the H matrix 
can have linear combinations of hi values, which also result in changes in computational complexity. 

VI. Discussion 

For an OSTBC Gn satisfying G§G = c(J2 k =i \ s k\\ 2 )I where c is a positive integer, the Maximum 
Likelihood solution is formulated in four equivalent ways 

\\Y - GnH\\ 2 = \\z - Fs'\\ 2 = \\z' - F's'\\ 2 = \\y - Hx\\ 2 . (57) 

There are four solutions, all equal. The first solution is obtained by expanding \\Y — GnH\\ 2 and is given 
by ([20]) when c = 1 [4, eq. (7.4.2)]. When c> 1, it should be altered as 

S * = ~ulF^l Re ^ HL lA " Y )} - 1 ■ Im{Tr(#*5f Y)}} k = 1, 2, . . . , K. (58) 
c \\H r 



(59) 



The second solution is obtained by expanding the second expression in (1571) and is given by 

A/ _ Re[F H z] 
S ~ c||#|| 2 " 

This is given in [4. eq. (7.4.20)] for c = 1. The third solution is the solution to the third equation in (1571) 

c\\H r 



The fourth solution is the one introduced in [1]. It is the solution to the fourth equation in (|571) and is 
given by 



Re(si) 
Im(si) 

Re(s K ) 
hn(s K ) 



a c \\H r 



Considering that 

F a = [vec(HAi) ■ ■ ■ vec(HA K )] F b = [ivec(HBi) ■ ■ ■ ivec(HB K )] (62) 



[4, eq. (7.1.7)], it can be verified that (1581) and (1591) are equal. The equality of (1591) and (|60l) follows from 
(|45l> -(|47 l) . The equality of d60j) and dHB follows from (|50]) and ([54b . Therefore, equations (|58T) - (|6l1) yield 
the same result, and when properly implemented, will have identical computational complexity. 
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Finally, we would like to state that a straightforward implementation of (|58l ) or (|59l ) can actually result 
in larger complexity than (|60l ) and (|6T| ). The proper implementation requires that in (|58l ) and (1591) , the 
terms not needed due to elimination by the Tr[ ], Re[ ], and Im[ ] operators are not calculated. We 
calculated the computational complexity values for the examples taking this fact into account. 
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